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Abstract 

To better understand the transcriptional regulation of high molecular weight glutenin subunit (HMW-GS) expression, we 
isolated four Glu-lBx promoters from six wheat cultivars exhibiting diverse protein expression levels. The activities of the 
diverse Clu-lBx promoters were tested and compared with p-glucuronidase {GUS) reporter fusions. Although all the full- 
length Glu-lBx promoters showed endosperm-specific activities, the strongest GUS activity was observed with the lBx7°^ 
promoter in both transient expression assays and stable transgenic rice lines. A 43 bp insertion in the 18x7°^ promoter, 
which is absent in the lBx7 promoter, led to enhanced expression. Analysis of promoter deletion constructs confirmed that 
a 185 bp MITE (miniature inverted-repeat transposable element) in the 76x74 promoter had a weak positive effect on Clu- 
IBx expression, and a 54 bp deletion in the 76x73 promoter reduced endosperm-specific activity. To investigate the effect of 
the 43 bp insertion in the 76x7°^ promoter, a functional marker was developed to screen 505 Chinese varieties and 160 
European varieties, and only 1 Bx7-type varieties harboring the 43 bp insertion in their promoters showed similar 
overexpression patterns. Hence, the 76x7°^ promoter should be important tool in crop genetic engineering as well as in 
molecular assisted breeding. 
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Introduction 

Hexaploid wheat (Triticum aestivum L.) is one of the most 
important human food sources. Its complex genetic background 
leads to great diversity in nutritional and processing qualities 
among cultivars. High molecular weight glutenin subunits (HMW- 
GSs) are the main grain storage proteins in the endosperms of 
wheat and related species [1,2]. Although HWM-GSs grain 
storage proteins account for only about 12% of the total protein 
[3], they play a key role in wheat gluten as the skeletal network 
that to a large extent determines its structure and formation [4]. 
The compositions and quantities of allelic variation in HMW-GS 
genes substantially affect the taste and appearance of dough 
products, such as Chinese noodles and European bread [5]. 
Therefore, improvement of flour quality based on superior HWM- 
GS alleles is necessary to meet changing consumer demands. 

Both qualitative and quantitative effects of HMW-GS subunits 
are important for flour quality [6,7]. In the process of breeding, 
high dough strength is used as a predictor of good-quality bread 
wheat; and overexpression of Glu-lBx7 by way of allele IBx?'^^ 
makes an important contribution to high dough strength in some 



cultivars [8,9]. Expression of HMW-GS is regulated by three 
major factors, which are at the genomic level (gene duplication), 
transcriptional level and translational level [10-13]. Transcrip- 
tional regulation driven by Glu-1 5'-upstream flanking regions 
might provide strategies for improving grain quality in wheat 
breeding programs [14], A number of crucial CM-acting elements 
from HMW-GS promoters of various wheat cultivars have been 
investigated and characterized, as these could affect tissue 
specificity or expression activity, including conservative endo- 
sperm-specific motifs, such as the GCN4 motif [15], the prolamin 
box [16], AACA/TA motif [17], RY repeat motif [18], and Skn-1 
[19,20], each of which is capable of exerting temporal expression 
[21,22]. However, the basis of transcriptional regulation of 
divergence caused by large insertion and deletion (InDel) 
alterations in HMW-GS promoter regions is still not clear. As 
reported earlier, a tandem 54 bp duplication, known as the 
"cereal-box" located at —400 bp in the IBx promoter may 
enhance endosperm-specific expression [1], suggesting this dupli- 
cated region might be a key region for control of gene expression 
[23,24]. There is a 185 bp MITE insertion in the promoters of 
lBxl4 and lBx20, but functional verification indicated that this 
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insertion liad little efTect on gene expression [25-27]. A 43 bp 
insertion found at —1000 bp in the IBx?'^^ promoter was 
significantly associated with the overexpression phenotype. It 
was speculated that the overexpression was brought about by gene 
duplication mediated by the insertion of a retroelement [13], and 
there was no further study concerning the 43 bp InDel eflfcc t on 
protein expression. Therefore, more experimental data are needed 
to clarify the effect of InDels in HMW-GS promoters. 

Highly active t-ndosperm-sperific promoters serve as an 
important genetic resource for high-quality and high-yield wheat 
breeding. Use of seed storage protein gene promoters is an 
attractive strategy for obtaining target gene products exclusively 
from crop kernels. A number of seed-specific promoters from 
barley, rice, maize and other species have been investigated 
functionally [15,21,28]. Transgenic crops with favorable gene 
stacking require different tissue-specific promoters from various 
cereals, as this is helpful to reduce homology-based transcriptional 
gene silencing [29,30]. HMW-GS promoters from wheat, 
although containing endosperm-specific motifs, may not be 
spatially controlled in the same way as in their original genetic 
backgrounds due to subtle differences in respective regulation 
systems [31]. Hence, further research of key motifs from tissue- 
specific promoters would boost applicability in genetic engineer- 
ing. 

Among hexaploid wheat HMW-GSs, Glu-1 Bx often shows the 
highest level of expression [32]. We therefore set out to analyse 
IBx promoter sequence characteristics to uncover the transcrip- 
tional regulation mechanism. Based on diverse protein expression 
levels in six wheat cultivars, we isolated four Glu-1 Bx promoters in 
approximately 2.2 kb of length and further validated their 
functions. By comparison with these upstream sequences, several 
large InDels such as a 43 bp InDel, a 54 bp duplication and a 
185 bp MITE resulted in major divergences among the four 
promoters, including the lBx7 promoter (Pro-lBx7), lBx7^^ 
promoter {Pro-lBx?'^'^), lBxl3 promoter {Pro-lBxl3) and IBxM 
promoter {Pro-lBxl4). The promoter sequence variation was 
shown to be an important factor causing differential expression in 
transient expression systems and in transgenic rice plant assays. 
Notably, Pro-lBx7^^ is a highly active endosperm-specific 
promoter that can be made available for crop improvement by 
transgenic methods. Moreover, we developed a new specific 
molecular marker in terms of the 43 bp insertion residing in the 
lBx7'^^ promoter, with which we screened 505 Chinese and 160 
European cultivars [33]. We found that this functional marker is 
significandy associated with lBx7 overexpression. Our results 
further showed that transcriptional regulation might be responsi- 
ble for IBx expression diversity to a larger extent than initially 
expected. 

Materials and Methods 

Plant materials 

Hexaploid wheat (Triticum aestivum L.) cultivars (cv.) Yanzhan 
1, Adas 66, Jimai 20, Xiaoyan 54, Yunmai 33 and Chinese Spring 
were grown in the field. Endosperm of Xiaoyan 54 was prepared 
for transient expression assays, and rice (Oryza sativa L. ssp 
japonica) cv. Kita-ake was used to produce stable transformants. 
Materials used for molecular marker screening included 505 
Chinese and 160 European cultivars [33]. 

SDS-PAGE and quantification of HMW-GSs 

Protein fractions were extracted from single wheat kernels using 
a previously reported HMW-GS extraction protocol [34]. 
Identical amounts of protein extracted from seeds of different 



varieties were separated by SDS-PAGE and visualized by 
Coomassie Blue staining as described by Zhang at al. [35]. 
Densitometric analyses of IBx subunits were carried out by 

Quantit)' One software (Bio-rad, USA). The value of the optical 
density multiplication area was used to quantify HMW-GS 
expression. 

Promoter isolation and cis-element prediction 

DNA extraction was p(;rformed as previously described [36]. 
Using a pair of specific primer sets, lBx2258F/R (Table SI), four 
fuU-length IBx promoters, ~2.3 kb in size, from six wheat 
varieties were isolated, gel purified and sequenced. Putative 
regulatory elements within the IBx promoter were predicted 
using the Plant Cis-acting Regulatory DNA elements (PLACE) 
database [37] combined with a previous report [38]. 

Construction of promoter-GUS chimeric genes and 
subsequent transformation 

Several full-length and truncated IBx promoters were obtained 
by PGR amplification with primers introducing DNA restriction 
enzyme sites for convenient subcloning (Table SI), and cloned into 
a modified vector PAHC25 [39] for transient expression, and then 
subcloned into binary vector pCAMBIA1391z, containing the 
reporter gene GUS under the control of different IBx promoters. 
In transient expression experiments, immature embryos harvested 
at 12-14 days post anthesis (DPA) were used for bombardment as 
described by Ortiz et al. [40]. Difierent IBx promoter-GUS 
constructs were tested in transient expression assay s as described 
previously [41]. For stable transformation, tiie binary xcctor 
constructs were frrst introduced into Agrobaclerium tumefaciens 
strain EHA105, and then rice transformation was carried out as 
described by Cho et al. [42]. Transgenic rice plants were selected 
on medium containing 50 mg L ' of hygromycin, and positive 
hnes were grown in the field for further analysis. 

PGR and Southern blot analyses of transformed rice 

plants 

Genomic DNA was isolated from leaf tissues of transformed rice 
plants as previously described [36]. PGR analysis for molecular 
identification of transgenic rice plants was performed using a set of 
specific primers for the IBx promoter and GUS gene (fisted in 
Table SI). 

For Southern blot analysis, genomic DNA (10 |J,g) from different 
transgenic lines were digested with Bamiil or Hindlll (New 
England Biolabs, USA). Digested DNA was separated by 
electrophoresis in 0.8% (w/v) agarose gels, and then transferred 
to Hybond-N"*" membranes (Amersham Biosciences, USA) and 
hybridized with a GUS gene fragment labeled with [a- P]dCTP 
as described previously [36]. 

Histochemical GUS assay 

Wheat endosperms undergoing transient expression and differ- 
ent tissues of T3 transgenic rice were used for histochemical GUS 
assays. GUS staining was performed as described by Kosugi et al. 
[43]. Images of stained samples were captured using an MZ16 
High-tech Stereomicroscope (Leica, Germany). Stained GUS 
spots were counted, and for statistical comparison, data of each 
sample was expressed as the mean number of blue spots per 
endosperm. 
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Quantification of expression of the GUS gene under 
control of IBx promoters in transgenic rice plants 

Total RNA was extracted from kernels of 3 independent 
positive T3 transgenic rice lines at 10-18 DPA. Transcriptional 
levels oi GUS in all stable transgenic Unas were quantified by 
quantitative real-time PGR (qRT-PCR) with a 7300 Real-time 
PGR system (Applied Biosystems, USA) using Power SYBR Green 
PGR Master Mix (Applied Biosystems, USA). Details of primer 
pairs used for qRT-PCR are given in Table SI. The specificity of 
the primer sets was assured by confirmation that the resulting 
products appeared as single peaks in real-time melting tempera- 
ture curves and as single fragments after separation by agarose gel 
electrophoresis. To confirm adequate amplification PGR efficiency 
was assessed using a sample dilution series as templates [44]. 
Amplification plots and predicted threshold cycle values were 
obtained from three independent biological rephcates with SDS 
software version 2.1 (Apphed Biosystems, USA). GUS gene 
expression levels were presented as fold-changes calculated using 
the comparative threshold cycle (CT) method as described [45] 
with rice GAPDH used as the internal control. 

Results 

Identification of HIVlW-GSs by SDS-PAGE 

HMW-GSs of six wheat cultivars were separated by SDS-PAGE 
(Figure lA). Their subunit compositions varied from each other 
(Table S2). Evidendy, expression levels of Glu-IA and Glu-ID are 
generally lower than Glu-IB. There were four allelic variants of 
IBx among the six cultivars, namely IBx?'^^, lBxl4, lBxl3 and 
lBx7. The protein level of 16x7°^ in Yunmai 33 was much 
higher than that of IBx in other cultivars, about 2.2-fold that of 
lBxl3 and lBx7 and 1.8-fold that of lBxl4 (Figure IB). 

Comparative analysis of upstream sequences of Glu-IBx 

Four types of 5' flanking sequences of IBx alleles were isolated 

with the specific primer pair, lBx2258F/R (Table SI and Figure 
SI). They were 2,294, 2,253, 2,18,5 and 2,433 bp in length for 
Pro-lBx?"'', Pro-lBx7, Pro-lBxl3, und Pro-lBxl 4, respectively. 
The four 5' proximal flanking regions contained five common 
motifs, including DOE recognition sites, bZIP recognition sites, 
MYB recognition sites, VPl recognition sites and basal promoter 
elements (Figure 2 and Table S3), which are conserved in 
promoters of gen(^s that (■n( <)d(^ most seed storage proteins [38]. 
In addition to sevc-ral single-base substitutions or small deletions, 
the presence of sequence insertions or deletions (InDels) consti- 
tuted the main differences among the four entire promoter regions 
(Figure S2). By comparison with the lBx7 promoter, the lBxl3 
promoter has a 54 bp deletion at —400 upstream from the start 

A f 

a "Roc 



codon (54 bp duplication position), the IBxM promoter contains 
a 185 bp MITE insertion at —874, consistent with previous 
reports [26,46], and the IBxl'^^ promoter possesses a 43 bp 
insertion at —1047, which is always associated with the 

overexpression phenotype [12]. 

The transient expression results for different IBx 
promoters 

We compared the expression efficiencies of the full-length 
promoters of lBx7, lBxl3 and lBx7'^'^ by means of transient 
expression assays in wheat endosperms (Figure S3), and GUS 
driven by the IBxl'^^ promoter exhibited much higher activity 
than when driven by the lBx7 and IBx 13 promoters (Figure 3A). 
Since the 43 bp InDel represents the difference between the lBx7 
and IBxl'^'^ promoter sequences, we speculated that the 43 bp 
insertion enhanced the endosperm-specific expression. In addition, 
the IBx 13 promoter activity was lower than that of the lBx7 
promoter, which lacks the 54 bp duphcation present in the lBxl3 
promoter. We further investigated the effect of the 185 bp MITE 
on lBxl4 expression. Two truncated lBxl4 promoters were fused 
to GUS, and transient expression results showed that GUS 
expression dri\en by a 1,192 bp Pro-lBxl4-D2 was slightiy 
higher than that dri\'en by the 873 bp Pro-lBxl4-Dl (Figure 3B), 
confirming that the MITE might positively but weakly affected 
transcription of lBxl4 [26]. 

Histochemical and quantitative assays in transgenic rice 

Chimc-ras were constructed using different IBx promoters fused 
to GUS and then transformed into rice. Through GUS staining 
assays in the T3 generation of stable transgenic plants, we detected 
CjUS activity driven by the three full-length IBx promoters only in 
the seeds and not in stems or leaves collected at 15 DPA. These 
results indicated that the promoters are endosperm-specific 
(Figure 4A). In contrast, GUS staining was observed in all tissues 
of transgenic rice carrying the Ubiquitin promoter-Gi/S construct 
(Figure 4A). Therefore, the full-length IBx promoters contained 
necessary cw-elements that specify endosperm-specific regulation 
in both wheat and rice. Consistent with the transient expression 
results, the full-length lBx7^^ promoter with the 43 bp insertion 
exhibited much higher GUS activity than either the full-length 
lBx7 or lBxl3 promoters (Figure 4A). Southern blot analysis 
confirmed that the transgenic rice Knes had single copies of the 
GUS gene (Figure S4); therefore the comparative results of 
promoter activities were convincing. 

We also investigated the activities of different truncated IBx 
promoters in transgenic rice. Except for the Pro-lBxl3-D 
promoter, other truncated promoters, including Pro-lBxl4-Dl/ 
D2 and Pro-lBx7-D, retained endosperm-specific expression 

B 



Glu-Bl 
locus ' 




Figure 1 . HMW-GSs from six common wheat cultivars. (A) SDS-PAGE profiles of HMW-GSs. 1 Bx7 (Chinese Spring), 1 Bxl 3 (Atlas 66 and Jimai 
20), 1Bx14 (Xiaoyan 54 and Yanzhan 1) and 1Bx7°^ (Yunmai 33). (B) Relative amounts of four IBx subunits estimated by densitometric analysis. 
doi:l 0.1 371/iournal.pone.01 05363.g001 
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Figure 2. Schematic structure of four 7£>r promoters. Regulatory elements are indicated by colored rectangles. InDels are labeled with hollow 
boxes and core sequences are listed in detail under the sketch map. The red underlined sequence shows the "cereal box" in the 54 bp duplication. 
The 8 bp bases in red at both ends indicate the target site duplication (TSD) of the MITE. Positions of upstream primers used for obtaining truncated 
promoters are indicated with black arrows. 
doi:1 0.1 371/journal.pone.01 05363.g002 



activity (Figure 4C). Absence of the 54 bp duplication led to loss of 
endosperm-specific expression controlled by the Pro-lBxl3-D 
promoter, thus indicating that the fragment removed from the 

lBxl3 promoter (—943 2162) contained necessary CM-ele- 

ments that restricted expression to the endosperm. Like transient 
expression results, GUS expression directed by Pro-lBxl4-D2 
harboring the 185 bp MITE was higher than that directed by Pro- 
lBxl4-Dl. 

To confirm the results of GUS staining in seeds of transgenic 
rice, we applied qRT-PCR to determine expression levels oiGUS. 
GUS expression detected in seeds during 10-18 DPA showed that 
expression driven by IBx promoters increased rapidly from 10 
DPA to 14 DPA, and reached a peak level at 14-16 DPA 
(Figure 4B and 4D). As the control, GUS expression driven by the 



Uhiquitin promoter maintained a relatively constant level through 
10 to 18 DPA (Figure 4B). The results of GUS expression at 14-16 
DPA were highly consistent with those of GUS activities based on 
histochemical staining (Figure 4A and 4C), confirming that the 
protein expression pattern was similar to the gene expression 
pattern at the mRNA level. 

Phylogenetic analysis of HMW-GS promoters 

To address the question of whether these InDels are present in 
other HMW-GS promoters, we analyzed promoters from 14 
different wheat HMW-GS genes [1,26,47] and identified the 
regulatory motifs related to endosperm-specific expression in the 
promoter regions about 1,200 bp upstream of the initiation codon. 
The numbers of regulatory motifs obviously differed among the 
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Figure 3. Schematic representation of constructs used for transient expression assays in wheat endosperms and GUS activities 
driven by full-length and truncated 7&ir promoters. (A) Effects of fulflength 1Bx7, lBxl3 and lBx7°^ promoters on transient expression. (B) 
Effect of the 185 bp MITE from the lBxl4 promoter on transient expression. **P<0.01 (student's f-test) indicates significant difference from others. 
doi:1 0.1 371/journal.pone.01 05363.g003 
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Figure 4. GUS staining of various tissues and quantitative analysis in developing endosperm of transgenic rice. (A) and (C) 

Histochemical analysis of GUS driven by different promoters in transgenic rice tissues collected at 1 5 DPA; (B) and (D) Relative expression levels of GUS 
in seeds from different transgenic lines during 10 to 18 DPA based on qRT-PCR. Rice GAPDH was used as the internal control. Values are shown as 
means ± s.d (standard deviation) of three independent experiments and three biological replicates. Colored lines at the top right corner represent 
different transgenic plants. 
doi:1 0.1 371/journal.pone.01 05363.g004 



different HMW-GS promoter sequences (Figure 5). In phyloge- 
netic analysis, all Glu-lBx promoters clustered together in one 
branch. The 185 bp MITE insertion was present in both the Glu- 
lBxl4 and Glu-lBx20 promoters. The 54 bp tandem duplication 
was absent in non-Glu-lBx promoters, but present in all Glu-lBx 
promoters except the Glu-lBxl3 promoter. Therefore, InDels 
contributed to the diversity in HMW-GS promoters, which is an 
important means for evolution of HMW-GS genes. These large 
fragment InDels can be used as a potential resource for creating 
new alleles. 



Development of a molecular marker for the 43 bp 
insertion and its distribution in natural populations 

Based on the 43 bp insertion sequence in the IBx?'''^ promoter, 
we developed a new molecular marker that differed from those 
previously reported [34]. Our marker can precisely identify the 
insertion in HWM-GS promoters among common wheat varieties. 
PGR amplification resulted in two kinds of bands that distinguish 
promoters with the 43 bp insertion (a 476 bp fragment) from those 
without (a 433 bp fragment) (Figure 6A). Among 505 Chinese and 
1 60 European accessions surveyed, we found 3 Chinese and 1 1 
European varieties with the 43 bp insertion when we used this 
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Sl<n-1 lil<e motif 
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Figure 5. Phylogenetic analysis of 14 HMW-GS promoters. Neighbor-Joining Tree of partial length (-1 ~ -1200 bp upstream of the start codon) 
sequences of Glu-1 from Triticum aestivum L and other wheat-related grass species. This work was done with the MEGA program (Version 5.2). InDels 
and conserved cis-elements are shown by markers in different colors. 
doi:1 0.1 371/journal.pone.01 05363.g005 
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marker. HMW-GS profiles of accessions containing the 476 bp 
marker were later obtained by SDS-PAGE (Figure 6B). The 
presence of particular IBx alleles was determined by densitometric 
analysis; 10 accessions (3 from China and 7 from Europe) 
exhibited a lBx7 overexpression phenotype relative to Chinese 
Spring used as a control (Figure 6B and Table S4). We also 
identified some types ot lBx6 and lBxl4 with the 43 bp insertion 
(Figure 6B and Table S4), and confirmed its presence by DNA 
sequencing. However, these two types of IBx did not show 
overexpression at the protein level. Therefore, the 43 bp insertion 
in promoters preferentially enhanced lBx7 expression although no 
obvious differences were found among the upstream regions of 
lBx6, 18x14 and IBx?"'^. 

Discussion 

HMW-GSs represent a set of important seed-storage proteins, 
and both their composition and quantity significantly affect wheat 
flour quality [7]. Since gene transcriptional regulation is the 
dominant means of control in production of proteins [48], we 
isolated four promoter sequences of Glu-lBx and investigated 
their effects on gene expression. Although the reporter gene driven 
by all the IBx promoters exhibited an endosperm-specific 
expression pattern, the IBx?^'^ promoter from cv. Yunmai 33 
produced a markedly stronger activity than other promoters. 
Previous studies showed that gene duplication was the cause of 



A 




Figure 6. Electrophoretic separation of PCR products from 7Bx 
promoters with or without the 43 bp insertion, and SDS-PAGE 
profiles of HiVIW-GS of wheat cultivars with the insertion. (A) 

PCR assays for IBx promoters on a 2% agarose gel. M: 100 bp DNA 
Ladder. Underlined accessions possess the 43 bp insertion. (B) SDS- 
PAGE assay of HIVIW-GS from different accessions containing the 43 bp 
insertion. Underlined accessions possess the 43 bp insertion. Down 
black arrows indicate lBx7 with 43 bp insertion in accessions from 
China and Europe. Chinese Spring, Yanzhan 1 and Samo were used as 
controls. The purple curve represents the relative amounts from 
different IBx subunits. 
doi:1 0.1 371/journal.pone.01 05363.g006 



1 Bx7 overexpression in some wheat cultivars [49] . The connection 
between the strong activity of the 75x7"^ promoter and high 
protein level produced by the lBx7''^ subunit clearly indicated 
that transcription regulation is also a factor in lBx7 overexpres- 
sion. We therefore concluded that multiple factors, including gene 
duplication and transcriptional regulation determine the expres- 
sion of lBx7. This work revealed a complex regulatory network of 
HMW-GS expression in wheat. 

The 43 bp insertion at -1047 bp is closely associated 
with high expression of 1Bx7°^ 

Large InDels in promoter regions often result in higher rates of 
transcriptional divergence [50]. In this study, we identified a 43 bp 
IiiDel, a 54 bp duplication and a 185 bp MITE in different Glu- 
IBx promoters, and they accounted for the main differences 
among IBx promoter sequences. These InDels affected the 
expression levels of the genes. The presence of the 43 bp insertion 
at position —1047 upstream of the start codon was shown to be 
closely associated with high expression levels of the lBx7'^^ 
subunit [12,13]. We verified that the 43 bp insertion can serve as a 
strong enhancer to improve expression of the gene by comparing 
the transcriptional activities between full-length lBx7 and IBx?'^^ 
promoters (Figure 3 A; Figure 4A and 4B). Since there are no 
known cii-elements in the 43 bp insertion, this insertion may 
facilitate evolutionary tuning of gene expression by affecting local 
chromatin structure and nucleosome positioning [50]. 

The 185 bp MITE insertion at -874 bp of lBxl4 might 
slightly affect transcription 

The 185 bp MITE insertion located at -874 bp in the IBxM 
promoter may be a remnant of an earlier transposition of a large 
element or of small, highly repeated elements [51]. In the present 
study, the lBxl4 promoter with or without the 185 bp MITE, did 
not produce a significantiy different activity in the transient system 
or in stable transgenic rice assays, suggesting that it might only 
slightly affect the transcriptional regulation (Figure 3B; Figure 4C 
and 4D). The 185 bp MITE exists in both hexaploid and 
tetraploid wheat, and may be linked to the polyploidization event 
affecting the constitutions and activities of the genomes of grass 
species [26]. 

A 54 bp cereal-box motif is necessary for endosperm- 
specific expression 

The tandem 54 bp duplication at position —400 contains the 
"cereal-box" implicated in seed-specific expression [1]. Our data 
demonstrated that the IBx 13 fuU-length promoter harboring one 
54 bp deletion retains endosperm-specific activity, but a lBxl3 
promoter truncated at —942 bp lacks endosperm-specificity 
accompanied by increased activity (Figure 4C). We speculate that 
the 54 bp deletion might complement essential cis-elements in the 
region —940 to —2000 bp of the lBxl3 promoter to effectively 
control gene endosperm-specific expression. Without the aid of the 
eis-elements in the region, only one 54 bp cereal-box motif may 
not be enough to restrict gene expression to endosperm. Based on 
phylogenetic analysis of HWM-GS promoters, only the lBxl3 
promoter and noii-7Sjc promoters contain a 54 bp deletion 
(Figure 5). This tandem 54 bp duphcation must have occurred 
before hexaploidization because it is also present in tetraploid 
wheat. Flanking-sequeiice divergence was also noted from 
extensive DNA sequencing analysis of a-gliadin genes [52]. The 
basis of HMW-GS evolution is repeated sequence events that lead 
to new alleles [53]. 
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A simple PCR marker was developed to target high 

expression of 1Bx7 and 1Bx7°^ 

Since previous 43 bp InDel marker covers a region of 
1.2~1.3 kb that also contains other InDels such as the 185 bp 
MITE and 54 bp duplication [34], a new specific marker based on 
the 43 bp insertion was developed and used eflFectively in two 
independent wheat populations combined with SDS-PAGE 
electrophoresis analysis to identify lBx7 overexpressing cultivars. 
Interestingly, the 43 bp insertion exists not only in the lBx7 
promoter but also in other IBx promoters such as those oi lBxl4 
and lBx6 (Figure 6B). Despite harboring the 43 bp insertion in 
the promoters, the lBxl4 and lBx6 subunits produce no 
significant increases in protein compared to subunit alleles without 
the insertion (Figure 6B). The likely reason is that a co-regulatory 
factor linking the 43 bp insertion to expression efficiency is present 
in the lBx7 alleles or regulation at the translational level might 
strongly influence the divergence in expression between lBx7 and 
non-lBx7 subunits. 

Putative additive effects of gene duplication and 
transcriptional regulation on 1Bx7 expression 

According to the literature, it is concluded that the lBx7 
overexpression phenotype is mediated by an LTR retroelement 
resulting in gene duplication along with the polyploidization event 
[13]. In the present study, we confirmed that a 43 bp insertion 
situated in the lBx7''^ promoter is capable of strengthening 
transcriptional activity markedly through transient expression and 
transgenic rice assays. By using molecular markers which can be 
used to indicate lBx7 gene duplication [13], we found that only 
the cultivar Yunmai 33 has both the 43 bp InDel and two lBx7 
copies (gene duplication), while other 9 cultivars with the 43 bp 
InDel have only one lBx7 copy (Table S4). Although the lBx7 
subunit of Yunmai 33 is the most abundant in this study 
(Figure 6B), other cultivars with the 43 bp InDel demonstrate 
higher lBx7 expression than the control, especially Chinese 
cultivars Enmai 4 and Chuanmai 10 (Figure 6B). So it can be 
inferred that both gene duplication and transcriptional regulation 
can lead to lBx7 overexpression, and their effects on lBx7 
expression can be accumulated. 

Endosperm is the storage tissue for starch and protein in cereal 
crops, which are the major sources of carbohydrates and proteins 
for humans. Improved yield and qualit)' of crops by genetic 
modification has huge potential, and some significant achieve- 
ments have already been accomplished [54]. Because continuous 
high expression of foreign genes in all tissues may cause 
detrimental effects in host plants [55], identification and applica- 
tion of strong endosperm-specific promoters will attract interest 
from breeders and biologists. In the current work, we identified a 
highly active lBx7^^ promoter that can enhance endosperm- 
specific gene expression at the transcriptional level, and it should 
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